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The distribution of the nuclear ground-state spin in a two-body random ensemble (TBRE) was studied using 
a general classification neural network (NN) model with two-body interaction matrix elements as input features 
and the corresponding ground-state spins as labels or output predictions. The quantum many-body system 
problem exceeds the capability of our optimized NNs in terms of accurately predicting the ground-state spin 
of each sample within the TBRE. However, our NN model effectively captured the statistical properties of the 
ground-state spin because it learned the empirical regularity of the ground-state spin distribution in TBRE, as 


discovered by physicists. 
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I. INTRODUCTION 


The atomic nucleus is a complex many-body quantum 
system. Conventionally, a many-body Hamiltonian or La- 
grangian must be constructed based on reliable interactions 
to investigate this complex system. However, such a task is 
usually challenging, as the interactions in many-body prob- 
lems are strongly entangled with the structures. Thus, the 
self-consistency requirement under a certain ansatz leads to a 
vague or inaccurate many-body Hamiltonian. Fortunately, if 
only the regularity and robust properties of many-body sys- 
tems that are independent of the interaction details are re- 
quired, the vagueness of the Hamiltonian provides an alter- 
native perspective, with random numbers as parameters of 
nuclear interactions, that is, by using random interactions to 
statistically probe the robust regularity of nuclei. 

The study of random interactions can be traced back to 
the investigation of Wigner’s random matrix theory (RMT) 
[1], where random numbers were used as the matrix ele- 
ments of the many-body Hamiltonian. The diagonalization 
of these random matrices yields spectral statistical properties 
that agree with experimental data. The spectral properties 
of the RMT were further linked to quantum chaos [2]. In 
the 1970s, Wong, Bohigas, et al. [3-5] introduced the idea 
of randomizing the two-body interaction matrix elements in 
shell-model calculations [6, 7] to quantitatively demonstrate 
the phenomenon of quantum chaos in nuclei [5, 8—11]. The 
shell-model calculations with random interactions generate 
an ensemble of virtual nuclei, which is known as the two- 
body random ensemble (TBRE). A study based on TBRE re- 
vealed that certain robust features of nuclei may be indepen- 
dent of the specific details of the interaction. 

Following this philosophy, Johnson, Bertsch, et al. [12, 13] 
reported a series of robust and interaction-independent statis- 
tical properties of low-lying states in nuclei. A significant 
finding was the "predominance of the spin-zero ground state" 
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in even-even nuclei. Even-even nuclei exhibited a consid- 
erably higher probability of having spin-zero ground states 
compared to the fraction of zero-spin configurations in the en- 
tire shell model space. Subsequently, this phenomenon was 
observed in the interacting boson model (IBM) [14-16]. The 
spin-zero ground states of even-even nuclei are convention- 
ally attributed to the short-range nature of the nuclear force. 
However, in the TBRE, the interactions are entirely random 
and no specific force was predominant. The predominance 
of the spin-zero ground state in the TBRE contradicts the 
conventional understanding of how spin-zero ground states 
emerge from even-even systems. Therefore, considerable ef- 
fort has been devoted to understanding this robust property 
of TBRE, which has proven to be significantly challenging 
and reflects the complexity of the quantum many-body prob- 
lem. Phenomenological attempts include the studies of the 
distribution of the lowest eigenvalues for each spin [14] and 
its width [17], geometric chaos of spin coupling [18], maxi- 
mum and minimum diagonal matrix elements [19], IBM-limit 
of spin distribution in the IBM with TBRE [20-22], wave- 
function properties of different spin ground states [23, 24], 
energy scale features of different spin ground states [25], and 
correlation between the probability of zero-spin ground states 
and the central values of the distribution of two-body matrix 
elements [26]. The probability distributions of various spin 
states as ground states must be mathematically calculated to 
explain this phenomenon. However, nuclear models are typi- 
cally nonlinear systems, which are difficult to apply to statisti- 
cal theories. Therefore, several empirical rules have been pro- 
posed to predict the probability distribution of ground-state 
spins. For example, Kusnezov et al. used the random polyno- 
mial method [24] to a priori determine the probability distri- 
bution for sp bosons and obtained results that were consistent 
with those obtained by Bijker et al. using mean-field methods 
[21, 22]. Chau et al. discussed the cases of d boson systems 
and four fermions in the f7/2 shell, demonstrating the correla- 
tion between specific ground states and the geometric shapes 
determined by nuclear observables, and predicting the proba- 
bilities for the ground-state spin [27]. Zhao et al. suggested 
that the spins of ground states in the TBRE may be associ- 
ated with specific two-body interaction matrix elements and 
thus proposed an empirical approach [28] to predict the dis- 


tribution of ground-state spins. The correlation between the 
ground-state spin and two-body interaction matrix elements 
in this empirical approach is also crucial in our work. 

Because the nonlinearity of the nuclear model is too com- 
plex to overcome, one can take a bypass to explain the origin 
of the predominance of zero-spin ground states. This can be 
achieved by using a sufficiently simple nonlinear to simulate 
the behavior of the shell model and studying the spin determi- 
nation mechanism therein, which may provide more insight 
from a different perspective. The neural network (NN) model 
is a potential candidate for such simulations owing to its pow- 
erful learning, prediction, and adaptation capabilities, which 
have been successfully applied in diverse fields such as lan- 
guage translation, speech recognition, computer vision, and 
even complex physical systems [29-32]. Specifically, NN 
models have been extensively used in nuclear structure stud- 
ies to predict unknown nuclear properties using existing ex- 
perimental data. These properties include the mass [33-35], 
charge radii [36, 37], low-lying excitation spectra [38, 39], 
and 8 decay lifetimes [40]. However, most of these stud- 
ies only utilized the fitting capacity of the NN without fully 
exploring its classification capability for nuclear structure re- 
search. 

In this study, we attempted to distinguish between samples 
with different ground-state spins in the TBRE by adopting 
the classification capability of an NN with supervised learn- 
ing. The adopted NN was trained using the interaction ma- 
trix elements from the TBRE samples as features and the 
ground-state spin as the label. In this process, the NN learnt 
the behavior of the ground state spin in the TBRE, as well 
as the specific correlations between the interaction elements 
and ground state spin, as described in the empirical approach 
[28]. A significant advantage of using the NN in the TBRE 
study lies in the ability of the TBRE to provide nearly infinite 
independent samples for NN training, thereby avoiding over- 
fitting. This enhanced the generalization ability of the NN 
and facilitated the simulation of the shell model production 
of the ground-state spin. We present the performance of the 
NN in predicting the ground-state spins and reproducing their 
distribution in the TBRE. The proposed NN architecture may 
serve as a valuable benchmark for other classification-based 
applications. 


Il. MODEL FRAMEWORK 
A. Two-Body Random Ensemble (TBRE) 


In the TBRE, the nuclear Hamiltonian includes only two- 
body interactions and is expressed as follows: 


H=9 Y Y Clnn Ay Gide) As Gaja) 

J jij2 jaja 
In Eq. (1), G ijai 
two-body interaction; At (j1j2) denotes the creation opera- 
tor of the nucleon pair with two nucleons on the jı and j2 


represents the matrix elements of the 


orbits coupled to total angular momentum J; and Ay (j3j4) 
represents the annihilation operator of the nucleon pair. 

In TBRE, the matrix elements Gy Jai in Eq. (1) are 
independent random numbers that follow a Gaussian distri- 
bution with a probability function 
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to maintain the invariance of the statistical distribution of in- 
teraction matrix elements during an arbitrary single-particle 
transformation. 


B. Classification neural network 


The classification model presented in this paper utilizes an 
NN, which consists of an input layer, one or more hidden lay- 
ers, and an output layer. The structure is illustrated in Fig. 1, 
(with one hidden layer shown as an example). The input layer 
receives the matrix elements of the two-body interactions in 
the shell model, specifically the Gy ja;jają Values in Eq. (1), 
where the number of inputs is equal to the number of inde- 
pendent two-body interaction matrix elements in a specific 
shell-model space. Based on the corresponding input interac- 
tions, the output layer provides the probabilities of different 
spin states being the ground state. The number of outputs 
should equal the number of possible ground-state spins. The 
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Fig. 1. Schematic of the adopted NN classification model. 


activation function used in this model was the rectified lin- 
ear unit (ReLU) function [41], which is explained in Table 2. 
Assuming that the vector 7 = {x;} represents the network 
input, that is, Œ two-body interaction matrix elements in Eq. 
(1), and 7 is the network output, whose elements correspond 
to the probability of each spin being the ground-state spin, the 
relationship (with one hidden layer) can be expressed analyt- 
ically as follows: 
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Here, W = {ax, bkj, Cj, dji} denotes the NN parameter vec- 
tor. 

The output layer introduces the softmax function[42], 
which transforms the unnormalized output values into non- 
negative probability values whose sum is one. 
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This operation preserves the differentiability property of the 
model and the relative order of unnormalized output values. It 
also allows the model output to be interpreted as a probability 
for each class, facilitating the direct interpretation and utiliza- 
tion of these probabilities for classification decisions. There- 
fore, they are frequently employed in NN models to solve 
classification problems. Here, P; is the probability that the 
kth spin is the ground-state spin. Thus, the maximum of Pk 
determines the ground-state spin according to the 7 feature, 
that is, the inputted two-body matrix elements. All the ele- 
ments Pk construct the predicted probability P from the NN 
model. 

To train the NN _~ model,we first 
training set consisting of MN _ samples, D = 
{(1, S1) à (@2,S2),..., (ZN, SN)} out of ~100,000 
shell-model calculations. Here, 7; includes two-body inter- 
action matrix elements in a single shell-model calculation, 
while S; is the corresponding ground-state spin obtained 
from such a shell-model. Secondly, for each S; spin, we 
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created the label P’ vectors, which is a hot-one vector, and 
only include one non-zero elements of value “1,” correspond- 
ing to a 100% probability of S; ground-state spin, and 0% 
probabilities of the rest other spins. Third, we defined the 


loss function to evaluate the similarity between the label Pi 
vector and the NN predicted P* vector from Eq. (5). 


loss( P+, P’) =- Pi logÊj, (6) 


which is a common loss function for training an NN model 
for classification problems. Using the training samples and 
corresponding loss function, we trained our network by ad- 
justing the network parameter vector © with the adaptive mo- 
ment estimation (Adam) optimization algorithm [43] to min- 
imize the sum of the loss functions for all training samples. 
Consequently, an NN model with predictive capabilities was 
developed. 


C. Shell Model Spaces 


We performed approximately 100,000 TBRE calculations 
in six model spaces. These included four valence nucle- 
ons in the f7/2 orbital virtual nuclear (simply expressed as 
(fr /2)*), four valence nucleons in the h4/2 orbital virtual 
nuclear ((h4, /2))s and two, four, and six valence neutrons 
in the sd shell (corresponding to 18, 2°, and 22 Ne nuclear, 
respectively), and six valence neuron in the pf shell (4°Ca 
nuclear). These six model spaces represent various levels of 
many-body complexity. 


In the (f7 j2)* space, the eigenvalues from the shell model 
are simple linear combinations of the two-body interaction 
matrix elements, as shown in Eq. (1.81) [44]. The ground 
state spin corresponds to the lowest eigenvalue associated 
with a specific spin. An NN without a hidden layer corre- 
sponds to linear combinations of the input two-body inter- 
action matrix elements, followed by the application of the 
softmax operation to identify the smallest linear combina- 
tions. Both models employ similar calculation processes to 
determine the ground-state spin, where the weight parame- 
ters. Thus, dj; parameters in Eq. (4) in the NN correspond 
to the cfp coefficients [44] in the shell model, and the soft- 
max input to the NN is equivalent to the energy eigenvalues 
of the shell model. Because a hidden layer complicates the 
NN model and violates the correspondence between the NN 
and shell models, we excluded the hidden layer in our NN 
model for the (f7/2)* space. 

For the (h11 j2)4 and !®Ne spaces, the complexity increases 
beyond the (f7 jo)! space. Some eigenvalues remain lin- 
ear combinations of two-body interaction matrix elements, 
whereas others must be obtained through diagonalization. Al- 
though some diagonalization processes with fewer than 5 are 
analytical, the weight parameters dj; can no longer be related 
to cfp coefficients. Therefore, hidden layers can enhance the 
adaptability of NN models to nonlinear diagonalization [45]. 
In the 2°: 22Ne and *°Ca spaces, the relationship between the 
eigenvalues and cfp coefficients is completely nonlinear. The 
transcendental nature of some of these relationships further 
necessitates hidden layers. Table 1 presents the TBRE sample 
sizes and input-output settings of our nn models for different 
spaces. 


Table 1. Input-output settings for the six model spaces. The input 
and output numbers correspond to the number of two-body interac- 
tion matrix elements and the number of possible ground state spins, 
respectively. 


Model Space Input Output 
(f7/2) 4 5 
(hi1/2)* 6 10 
18Ne 30 5 
20Ne 30 7 
22?Ne 30 8 
46Ca 94 13 


D. Optimization of the network architecture 


For the (fz /2)* space, the shell model eigenvalues were 
linear combinations of two-body interaction matrix elements 
and the softmax input in the NN model. Therefore, both mod- 
els employed similar calculation processes for determining 
the ground-state spin. As mentioned in Section IIC, hidden 
layers were not required. In fact, an NN model without a 
hidden layer has already predicted ground-state spins in the 
(fz/2)* space with up to 98% accuracy. 

For the (hy, jo)" space, because certain eigenvalues dis- 
played nonlinear correlations with the two-body interaction 


matrix elements, incorporating hidden layers into the model 
was essential for enhancing the prediction accuracy. We first 
added one hidden layer and empirically selected 64 hidden 
nodes for the test run. The results demonstrated an accuracy 
of 97%, which was a satisfactory outcome. 

For the remaining four spaces, the arbitrary accuracies 
were not always optimal. Therefore, we attempted to improve 
the prediction accuracy of our NN classification model by in- 
cluding more hidden layers and increasing the number of neu- 
ral nodes in the 18Ne, Ne, ??Ne, and “Ca model spaces. 

First, the prediction accuracy improved when the number 
of neural nodes was doubled, as indicated by the difference 
between the prediction accuracies with N/2 and N neural 
nodes, as shown in Fig. 2. The absence of negative differ- 
ences (Fig. 2) suggests that doubling the number of neural 
nodes consistently improves results, as expected. Further- 
more, the differences reached a peak when N = 32 for all 
four model spaces. For N > 32, the prediction accuracy im- 
proved by only 0 ~ 2%. Considering that more nodes entail 
additional computational overhead, we believe that 32 nodes 
may be an optimal and balanced choice for this study. 
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Fig. 2. (Color online) Difference in the prediction accuracies of 
models employing N/2 and N neural nodes for SNe, ?°Ne, ?*Ne, 
and *°Ca model spaces with a single hidden layer. Thirty-two nodes 
are recommended. 


Furthermore, we investigated the impact of hidden layers 
on the prediction accuracy. By employing 32 neural nodes in 
each layer, as shown in Fig 2, we present the difference in pre- 
diction accuracy between networks with n — 1 and n hidden 
layers against the layer number n in Fig 3 for the !8Ne, ?°Ne, 
22Ne, and 4°Ca model spaces. The accuracy improved signif- 
icantly with a single hidden layer, that is, n = 1. However, 
these improvements diminished with the introduction of ad- 
ditional layers. Because additional layers also consume com- 
putational resources, a single hidden layer may be the optimal 
choice. 

Activation functions [46] play a crucial role in NNs 
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Fig. 3. (Color online) Difference between the prediction accuracies 
of networks employing n and n — 1 hidden layers for the 1°Ne, 
20Ne, ??Ne, and “°Ca model spaces. Thirty-two neural nodes in 
each hidden layer are recommended, as in Fig 2. A single hidden 
layer is recommended. 


by learning abstract features via nonlinear transformations. 
Common activation functions are the sigmoid (also called 
the logistic function), tanh (hyperbolic tangent), and ReLU 
functions. Table 2 presents the impact of different activation 
functions on the prediction accuracy of our NN model. of 
The tanh and ReLU functions exhibit similar model predic- 
tion accuracies for the five model spaces. However, because 
the tanh activation function includes an exponential opera- 
tion, the computational overhead may be larger; therefore, we 
used the ReLU function throughout this study. 


Table 2. Prediction accuracy (%) with three different activation func- 
tions, namely, sigmoid, tanh, and ReLU. All the calculations are per- 
formed with a single 32-node hidden layer NN model. 


Activation function (h11/2) Ne Ne Ne Ca 


Sigmoid 95.36 79.21 66.95 77.22 55.34 
Tanh 96.15 85.10 67.69 78.39 55.67 
ReLU 96.69 86.36 67.88 78.62 55.62 


In summary, the optimal network configuration for the 
18Ne, ?°Ne, ?2Ne, and 4°Ca model spaces considered in our 
analysis comprised one hidden layer with 32 ReLU neural 
nodes. 


Ill. RESULTS AND ANALYSIS 
A. Model comparison 


As shown in Fig. 1, we adopted a fully connected 
NN model. However, considering the recent application of 
Bayesian NNs (BNNs) in nuclear physics [33, 34, 37, 39] 


and the great success of convolutional NNs [47-50] (CNN) 
and recurrent NNs [51-53] (RNN), we compared these four 
networks in terms of accuracy, as shown in Fig 4. 
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Fig. 4. (Color online) Prediction accuracies with different NN mod- 
els. We adopt the classic fully connected NN as demonstrated in Fig. 
1. BNN stands for Bayesian NN; CNN for convolutional NN; RNN 
for recurrent NN. 


The BNN was implemented via Bayesian sampling of 
weights and biases facilitated by a variational inference al- 
gorithm to optimize model training. The model adopted 1000 
iterations to update loss and accuracy. During the prediction 
process, we sampled 1000 times to obtain precise probabil- 
ity prediction results. The CNN included a convolutional, 
pooling, fully connected, and softmax layers. In particular, 
in the convolutional layer, the input channel was the num- 
ber of input features, the output channel was defined as 16. 
The convolution kernel size was specified as three to facili- 
tate feature extraction. Subsequently, in the pooling layer, the 
pooling kernel size and step size were set to two to reduce 
the dimensions of the feature map. The fully connected layer 
mapped the features extracted by the convolutional layer to 
the final classification result based on the feature dimensions 
and category counts of the task. The model parameters were 
adjusted and iteratively optimized throughout the construc- 
tion process to ensure effective feature extraction and classi- 
fication. A remarkable characteristic of the RNN is its ability 
to continuously transmit and share information through recur- 
rent connections. In addition, the network calculation process 
defines the forward propagation function, which encompasses 
the output generated after the input calculation and subse- 
quent prediction through the fully connected layer and soft- 
max function. All four models, including the adopted clas- 
sic softmax model, shared consistent parameters, including a 
training-to-test set ratio of 2:1, a learning rate of 0.01, over 
1000 training epochs, and the use of the ReLU activation and 
Adam optimization functions. 

According to Fig. 4, the CNN performs the worst, while 
the accuracies of both BNN and RNN are similar to that of 
the adopted network. However, the adopted networks had 


faster training speeds and required fewer computational re- 
sources. Therefore, we believe that the adopted network was 
the optimal choice for our study. 


B. Feature selection 


Feature selection is crucial in machine learning and 
data analysis because it enhances model performance, miti- 
gates the risk of overfitting, boosts computational efficiency, 
streamlines model interpretation, and addresses problems re- 
lated to noise and redundant information. This involves con- 
ducting a correlation analysis to assess the relationship be- 
tween each feature and target variable. Subsequently, the fea- 
tures exhibiting a strong correlation with the target variable 
are selected, whereas the others are excluded from further 
training. Considering the nonlinear nature of both our feature 
and label data, we utilized the Spearman correlation coeffi- 
cient p [54] for feature selection as 


6a 
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p=1- (7) 


where d; and n represent the difference in the rank values 
of the i-th data pair and total number of observed samples, 
respectively. 


We calculated the p coefficients for four high-dimensional 
model spaces, namely, 18~??Ne and *°Ca model spaces and 
used different threshold sizes to select input features with 
strong correlation. Only features with p values greater than 
the threshold were retained for further training. In Table 3, 
we list the number of elements of the two-body matrix, that 
is, the number of input features over a certain threshold, and 
the corresponding accuracy. 


Table 3. Model accuracies (%) and input numbers under different 
feature selection thresholds. Threshold 0 indicates no feature selec- 
tion. 


threshold 0.1 0.01 0.001 0 

18Ne accuracy 70 77 85 86 
input number 5 14 28 30 

20Ne accuracy 60 65 66 68 
input number 6 17 25 30 

22Ne accuracy 71 74 76 80 
input number 4 16 25 30 

46Ca accuracy 53 56 56 56 
input number 1 30 80 94 


According to Table 3, the number of inputs after feature 
selection decreases with increasing threshold values, as antic- 
ipated. However, this reduction in the input number implies 
declining performance. Consequently, each input two-body 
matrix element within the four model spaces significantly af- 
fects the output and thus, the exclusion of any of these from 
network training is not recommended. 


C. Accuracy 


Figure 5 presents the evolution of the loss function during 
training. As expected, the loss functions of the six model 
spaces converge, indicating that the network parameters are 
optimal. The loss values of two single-7 model spaces, 
namely, (f7/2)* and (f11/2)*, decrease the most dramatically 
because the single-7 spaces are simpler than the other 4 mod- 
els. Furthermore, “Ca, ?°Ne, ??Ne, and 18N all converge 
with large loss values, corresponding to the unsatisfactory ac- 
curacy described in Table 4. Thus, increasing the number of 
training epochs does not improve the accuracy. 
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Fig. 5. (Color online) Evolution of the loss functions during training. 


Table 4 shows the correlation between the prediction ac- 
curacy (%) of the NN for the ground-state spin and the di- 
mensions of the six model spaces under investigation. The 
shell model eigenvalues of the (f7 IJ space are linear com- 
binations of two-body interaction matrix elements; therefore, 
the NN model is equivalent to linear regression and achieves 
high prediction accuracy of up to 98%. The accuracy is 97% 
for the (hy, /2)* space with one hidden layer; however, some 
eigenvalues in the (h11 ID space exhibit nonlinear relation- 
ships with the two-body interaction matrix elements. 


Table 4. Model space dimensions, prediction accuracy of the NN, 
and consistent rate of the G — J correlations between the SM and 
NN (see Subsection III D for definition). 


Model Space (f7/2)” (hit/2) Ne “Ne “Ne “Ca 
dimension 8 23 14 81 142 3952 
accuracy (%) 98 97 86 68 8 56 


consistency (%) 100 100 100 60 80 74 


For the remaining four spaces, the accuracy decreased sig- 
nificantly with increasing dimensions. We obtained a Pearson 
correlation coefficient [55] of -0.753 between the prediction 
accuracy (%) and dimensions on a logarithmic scale, indicat- 


ing a negative correlation between the two variables. As the 
dimensions of the space and the shell complexity increased, 
the ability of the NN model to predict the ground-state spin 
diminished. As shown in Fig. 2 and 3, introducing additional 
hidden layers or neural nodes does not significantly improve 
the performance of general classification NNs. Thus, to ac- 
curately predict the ground-state spin in the TBRE, the gen- 
eralization capability of the NN is strongly challenged by the 
complexity of the quantum many-body system, and a more 
specialized NN architecture and activation function should be 
designed according to the cfp coefficient property and diago- 
nalization process. 

To obtain a more detailed picture of the prediction perfor- 
mance of the NN model for TBRE samples with a specific 
spin, Fig. 6 presents the confusion matrix for the NN mod- 
els of the six model spaces. In the confusion matrices, the 
y- and x-axes represent the ground-state spin predicted by the 
NN (Iyn) and that obtained from the shell model calculations 
(Ism), respectively. The gray scale indicates the probability 
of the shell model calculation yielding a ground-state spin of 
Ism in the samples for which the NN predicts a ground-state 
spin of Inn. The main diagonal of the confusion matrix ap- 
pears predominantly dark, indicating a reasonably high de- 
gree of consistency between the NN and shell model for a 
specific ground-state spin. From a statistical perspective, the 
NN captures some correlations between the ground-state spin 
and two-body interaction matrix elements of the TBRE. 

The data in Table 4 indicates that the prediction accuracy 
for the ground-state spin of the ?°Ne nucleus is lower than 
that for higher-dimensional ?*Ne. This finding is consistent 
with the observations shown in Fig. 6. Specifically, for ?°Ne, 
the difference in colors between the main diagonal and other 
regions is less pronounced than in other nuclei. This sug- 
gests that the prediction of the ground-state spin in 7”Ne space 
poses greater challenges to the NN, which may be related to 
some special properties of the cfp coefficients of ?°Ne. Fur- 
ther exploration of specific multibody complexity features in 
20Ne space is desirable. 

To further evaluate the statistical performance of the NN 
model, Fig. 7 presents the distribution of ground-spin spins [ 
(Pr) obtained using both the shell model and the well-trained 
NN model with random interactions. The NN model is con- 
sistent with the shell model for all model spaces and partially 
succeeds in capturing the robust statistical properties of the 
TBRE. 


D. G-— I correlation 


To predict P; in TBRE, Zhao et al. proposed a general em- 
pirical approach [19]. In their approach, one of the two-body 
interaction matrix elements was set to -1, whereas the rest 
were set to 0. This interaction was then input into the shell 
model, and the output ground-state spin I was recorded. If N 
independent two-body interaction matrix elements existed in 
the model space, the process was repeated N times, setting 
a different matrix element equal to -1 each time. Finally, the 
number of times the spin J was observed as the ground-state 
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Fig. 6. Confusion matrices for predicting the ground-state spin 
using the NN model in the (fr/2)*, (Rity2)*, 18Ne, 2°Ne, 22Ne, and 
46Ca TBRE calculations. The y- and x-axes represents the ground- 
state spin predicted by the NN (Jn) and that obtained from the shell 
model calculations (Ism), respectively. The gray scale represents 
the probability of the shell model calculation yielding a ground-state 
spin of Ism in the samples for which the NN predicts a ground-state 
spin of Inn. 
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Fig. 7. (Color online) Distribution of the ground-state spin J (Pr) for 
(fr/2)*, (hity2)*, 18Ne, 2°Ne, 22Ne, and *°Ca. The black square 
and red circle represent the Pr obtained from shell model calcula- 
tions with random interactions and that predicted by the NN model, 
respectively. The blue triangle and olive star represent the P; ob- 
tained with the empirical approach [28] applied to the shell model 
and NN model, respectively. 


spin in N numerical experiments was represented as Nz. The 
probability of spin J in the ground state was then estimated 
as follows: 


P; = N/N. (8) 


The empirical approach [19] attributes the "specific spin I 
as the ground-state spin" to a few two-body interaction ma- 
trix elements. If there are more two-body interaction matrix 
elements responsible for the spin J = 0, the empirical rule 
provides a phenomenological explanation for the dominance 
of the ground state with zero spin. 

Note that the empirical approach hints at the correlation be- 
tween the two-body interaction matrix elements and ground- 
state spin, and its correlation determines the ground-state spin 


distribution, as shown in Fig. 7. Thus, the NN model with a 
good prediction of the TBRE ground-state spin distribution 
should also produce a similar correlation between the two- 
body interaction matrix elements (GIL, jgj4) and ground- 
state spin (J) as the shell model. Therefore, the element-spin 
(G—TI) correlations in the shell model must be compared with 


those in the NN model. 


Table 5. Ground state spin(/) from the shell and NN models for the 
(frj2)* and (hits2)* model spaces, with input G7 = —1 for some 
specific J, and other G7 s equal to 0. Here, G denotes the two-body 
interaction matrix element Gj; jj» as defined in Eq. (1). This table 
summarizes the correlation between the two-body interaction matrix 
elements and ground-state spin in the empirical approach. 


(fr/2) (h11/2) 
Gy SM NN SM NN 
G@ 0 0 0 0 
G? 4 4 4 4 
Gt 2 2 0 0 
E 8 8 4 4 
G 8 8 
qi 16 16 


In Tables 5, 6 and 7, the G—J correlations between the two- 
body interaction matrix elements (Gt jovsja defined in Eq. 
(1)) and the ground-state spin (J), obtained from the empirical 
approach applied to both the shell and NN models, are listed 
for the (f7/2)4 and (h11/2)* model spaces, Ne isotopes, and 
46Ca, respectively. 

According to Table 5, in the (fr/2)* and (Aitj2)* model 
spaces, the NN model produces G — I correlations that are 
perfectly consistent with those of the shell model. This ex- 
plains the agreement between the shell and NN models shown 
in Figures 6(a,b) and 7(a,b). In Table 6, this perfect consis- 
tency can also be observed for !8Ne space in coordination 
with Figures 6(c) and 7(c). However, as the dimensions in- 
crease, the consistency of 2° ??Ne in Tables 6 and “°Ca in 
Table 7 gradually decreases. In the ?°Ne space, there are 12 
inconsistent G — I correlations out of 30 (40%) between the 
SM and NN; in ??Ne, six out of 30 (20%) are inconsistent; 
and in 46, 24 out of 94 (~26%) are inconsistent. These incon- 
sistent rates are also correlated with the prediction accuracy 
for different model spaces, as shown in Table 4. 

Furthermore, the empirical approach was also applicable 
to the trained NN model by setting one of the inputs of the 
NN to -1 and the rest to O and recording the ground-state 
spin (J) from the network. This approach also revealed the 
correlation between the interaction matrix elements and the 
predicted ground-state spin, as well as the Pr distribution, 
of the well-trained NN model. Table 5 presents the correla- 
tions between the matrix elements and the spin obtained from 
the shell and NN models. The correlations from both mod- 
els are identical, indicating that our NN model successfully 
learns the G — J correlation, as suggested by the empirical ap- 
proach. Thus, it accurately reproduces the ground-state spin 
of the shell model for simple model. 

With the G — J correlation from such an NN model, we 
counted the ground-spin Is emerging in the G — J correlation 


Table 6. Same as Table 5, except for 18Ne, 20Ne, and ??Ne, with 
Gi, jz;jają aS the matrix elements of the two-body interaction. The 
subscripts jı, j2, j3, j4 are equal to 1, 2, 3, corresponding to 81/2, 
d3/2, and ds5/2 orbits in the sd shell, respectively. J = 0 ~ 4 in 
this table represents the degenerate states with spin 0, 1, 2, 3, and 4 
from the shell model. The inconsistency between the NN and shell 


models is highlighted in bold. 


T8 Ne 20Ne 72Ne 
Gy, jjaja SM NN SM NN SM NN 
Ghia 0 0 0A4 0 0~6 0 
GL iss 0 0 0,2,4 0 0,2,4 0 
Gliss 0 0 0 0 0,2 0 
G222 0 0 02.4 0 0~5 0 
Go033 0 0 0 0 0 0 
G3333 0 0 0 0 0~2 0 
Gisis 1 1 1 0 0 0 
Goo 1 1 2 0 0 0 
eres 1 1 0 0 0 3 
Gizie 2 2 0,2 0 0 0 
Bois 2 2 2 0 2 2 
Gass 2 2 1nd 0 0~6 0 
en 2 2 0 0 0 2 
Goo33 2 2 0 0 0 0 
Gi313 2 2 4 2 0,2,4 2 
(er 2 2 0 0 0 0 
G o3 2 2 0 2 0 0 
Gi333 2 2 2 2 0~4 2 
G3o09 2 2 0 0 0,24 0 
(ea 2 2 2 0 2,3 0 
G3o33 2 2 0 0 0 0 
G3a03 2 2 2 0 0 0 
G3.533 2 2 0 0 0 0 
G3.533 2 2 2 0 0 0 
G3313 3 3 5 2 0,2,4 3 
Gisa 3 3 4 0 3 0 
GB503 3 3 0 0 0 0 
G303 4 4 6 6 6 6 
Gasas 4 4 4 0 2,3 0 
G3333 4 4 4 4 0 0 


and then normalized them to the Py distribution, as guided by 
the empirical approach. The Py distributions based on the em- 
pirical approach using the NN model are shown in Fig. 7. The 
empirical approach from both the SM and NN models yields 
reasonably consistent Py distributions for all model spaces in 
this case, suggesting that the NN may effectively capture the 
correlation between the two-body interaction matrix elements 
and the ground-state spin, which further explains its remark- 
able performance in reproducing the statistical properties of 
the ground-state spin in the TBRE. 


IV. CONCLUSION 


This study used an NN model to investigate the distribution 
of the ground-state spin in the TBRE. Using a softmax classi- 
fication NN model, we attempted to reproduce the correlation 
between the matrix elements of the interaction and ground- 


Table 7. Same as Table 6, except for “Ca. The subscripts 
Jı, j2, J3, ja are equal to 1, 2, 3, 4, corresponding to the p1/2, 
13/2) f5/2, and f7/2 orbits in the pf shell, respectively. 


Gir josjsga SM NN Gia jorjaga SM NN 
Cras 0~10 0 Gozi 0 0 
Ot ae 0~10 0 G3a44 0 0 
Goss 0,2~6 0 G2 isi 4 0 
Goa 0~4 0 Gaua 0 0 
e 0~10 0 G u 2. 2 
Goo33 0,2,46 0 Gees 0,2,4 3 
Geass 0 0 Gai 3 0 
G3333 0~6 0 Geos 3 0 
G$344 0 0 Gea04 4 0 
Gass 0~4 0 G3 534 4 0 
Ghote 0 0 G ia 0,2,4~8 2 
Gł223 0 0 G3ios 0 0 
Ginga 0,9 0 GB roa 0 0 
G3323 0 0 GB asa 0 0 
G3334 0 0 G3303 3 0 
G3a34 1,8 0 G3a04 0 2 
Giois 0 0 G3334 0 0 
Gros 2 0 G3yoa 0 0 
e 0~10 0 Cisi 0 0 
Go s3 0,4,6 0 G ii 0,10 0 
Gino 0 0 Gtata 0,2,4~6,8 8 
Gj233 0 0 Gi423 2 0 
Grose 0,9 0 Gaza 6 0 
Gloss 0 0 Giazs 0 0 
Giss 0,2,4 2 Qiss 1 0 
G aao 0,2,4,6 0 e 0~4 0 
Gis03 0 0 G5323 6 0 
Gîs24 0 0 Go304 2 0 
Giaa 0~8 0 G55 1X6 0 
Gissa 2 0 G3334 0,9 0 
Grass 2 0 Go344 4 0 
G3o00 0~6 0 Goa24 0 0 
Gbo03 1,2,4,5 0 Qiiss 0 0 
G3oo4 0 0 Goa34 3 0 
Gbo33 0,2~4,6 0 G ui 0,2~4 0 
Goo34 0 0 G3333 0 0 
Goad 0,2~4 0 Gissi 0 0 
Q23 0 0 G3344 0,10 0 
G3aca 0 0 G3a34 0 0 
G233, 0,2~4,6 0 G3444 0 0 
Gsi 0,10 0 Gija 4 4 
G3oaa 0 0 GŽ424 10 9 
G3ao4 0,9 0 GŽ434 0 0 
Gĉ433 0 0 G3434 1 0 
G3434 0 0 Bias 12 10 
G3 aaa 0 0 Gaua 0 0 
G3333 0 0 Gass 6 6 


state spin, as labeled by the shell model, for the TBRE. The 
reliability of the NN model was analyzed based on its predic- 
tion accuracy and consistency with the empirical rules of the 
Py distribution. 

Previous applications of NN models in nuclear physics pri- 
marily focused on their strong fitting capabilities. However, 
the analysis of the ground-state spin distribution in TBRE 


demonstrated the classification ability of the NN, which is 
rare in previous nuclear physics research. Furthermore, 
TBRE provided extensive samples for training NNs, thereby 
potentially enhancing the performance of the NN model. 

In our investigation, we adopted various strategies to en- 
hance network performance, including the introduction of 
BNN, CNN, and RNN, feature selection, and adjusting the 
number of neural nodes and hidden layers. However, none 
of these approaches yielded significant improvements with 
limited computational resources. Therefore, we must ac- 
knowledge that the quantum many-body problem remains a 
formidable challenge for NN models. Addressing this chal- 
lenge may necessitate the further development of NN archi- 
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